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In this paper we propose some novel path planning strategies for a double integrator with bounded 
velocity and bounded control inputs. First, we study the following version of the Traveling Salesperson 
Problem (TSP): given a set of points in Mf^, find the fastest tour over the point set for a double integrator. 
We first give asymptotic bounds on the time taken to complete such a tour in the worst-case. Then, we 



unvisited targets does not diverge over time? If such stable policies exist, what is the minimum wait 



algorithms give identical performances for a particular nonholonomic vehicle, Dubins vehicle. 



^ : I. INTRODUCTION 

H 

5r , The Traveling Salesperson Problem (TSP) with its variations is one of the most widely known 

combinatorial optimization problems. While extensively studied in the literature, these problems 
continue to attract great interest from a wide range of fields, including Operations Research, 
Mathematics and Computer Science. The Euclidean TSP (ETSP) [1], [2] is formulated as follows: 
given a finite point set P in M'^ for d E N, find the minimum-length closed path through all points 
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in P. It is quite natural to formulate this problem in the context of other dynamical vehicles. 
The focus of this paper is the analysis of the TSP for a vehicle with double integrator dynamics 
or simply a double integrator; we shall refer to it as DITSP. Specifically, DITSP will involve 
finding the fastest tour for a double integrator through a set of points. 

Exact algorithms, heuristics and polynomial-time constant factor approximation algorithms are 
available for the Euclidean TSP, see [3], [4], [5]. However, unlike most other variations of the 
TSP, it is believed that the DITSP cannot be formulated as a problem on a finite-dimensional 
graph, thus preventing the use of well-established tools in combinatorial optimization. On the 
other hand, it is reasonable to expect that exploiting the geometric structure of feasible paths for 
a double integrator, one can gain insight into the nature of the solution, and possibly provide 
polynomial-time approximation algorithms. 

The motivation to study the DITSP arises in robotics and uninhabited aerial vehicles (UAVs) 
applications. In particular, we envision applying our algorithm to the setting of an UAV monitor- 
ing a collection of spatially distributed points of interest. Additionally, from a purely scientific 
viewpoint, it is of general interest to bring together the work on dynamical vehicles and that on 
TSP. UAV applications also motivate us to study the Dynamic Traveling Repairperson Problem 
(DTRP), in which the aerial vehicle is required to visit a dynamically generated set of targets. 
This problem was introduced by Bertsimas and van Ryzin in [6] and then decentralized policies 
achieving the same performances were proposed in [7]. Variants of these problems have attracted 
much attention recently [7], [8], [9], [10], [11]. However, as with the TSP, the study of DTRP 
in conjunction with vehicle dynamics has eluded attention from the research community. 

The contributions of this paper are threefold. First, we analyze the minimum time to traverse 
DITSP in for d E N. We show that the minimum time to traverse DITSP belongs to 0(ra'^~^) 
and in the worst case, it also belongs' to Q{n^~^). Second, we study the stochastic DITSP, i.e., 
the problem of finding the fastest tour through a set of target points that are uniformly randomly 
generated. We show that the minimum time to traverse the tour for the stochastic DITSP belongs 
to r2(n^/'^) in and Q(n'^/5) in M"^. Drawing inspiration from our earlier work [12], we propose 
two novel algorithms for the stochastic DITSP: the RECURSIVE Bead-Tiling Algorithm 

'For /,p : N ^ R, we say that / e 0{g) (respectively, / G ^{g)) if tlrere exist Nq G N and fc G R+ suclr tiiat 
\fiN)\ < k\g{N)\ for all N > No (respectively, \f{N)\ > k\g{N)\ for all N > No). If / G 0{g) and / G n{g), then we use 
the notation / G 6(g). 
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for and the Recursive Cylinder-Covering Algorithm for M^. We prove that these 
algorithms provide a constant-factor approximation to the optimal DITSP solution with high 
probability. Third, we propose two algorithms for the DTRP in the heavy load case based on the 
fixed-resolution versions of the corresponding algorithms for stochastic DITSP. We show that 
the performance guarantees for the stochastic DITSP translate into stability guarantees for the 
average performance of the DTRP problem for a double integrator. Specifically, the performances 
of the algorithms for the DTRP are within a constant factor of the optimal policies. We contend 
that the successful application to the DTRP problem does indeed demonstrate the significance 
of the DITSP problem from a control viewpoint. As a final minor contribution, we also show 
that the results obtained for stochastic DITSP carry over to the stochastic TSP for the Dubins 
vehicle, i.e., for a nonholonomic vehicle moving along paths with bounded curvature, without 
reversing direction. 

This work completes the generalization of the known combinatorial results on the ETSP 
and DTRP (applicable to systems with single integrator dynamics) to double integrators and 
Dubins vehicle models. It is interesting to compare our results with the setting where the vehicle 
is modeled by a single integrator; this setting corresponds to the so-called Euclidean case in 
combinatorial optimization. The results are summarized as follows: 





Single 
integrator 


Double 
integrator 


Dubins 
vehicle 


Min. time for 

TSP tour 
(worst-case) 


e(n^-3) [2] 


Q(ni-3), 
O(ni-ii) 


0(n) [13] 
(d = 2,3) 


Exp. min. time 
for TSP tour 
(stochastic) 


e{n^-i) [2] 


w.h.p. 

(d = 2, 3) 


e(ni-2db) 

w.h.p. 

(d = 2, 3) 


System time 
for DTRP 


0(A'^-i) [6] 
(d=l) 


e(A2('i-i)) 

(d = 2, 3) 


e(A2('i-i)) 

(d = 2, 3) 



Remarkably, the differences between these various bounds for the TSP play a crucial role 
when studying the DTRP problem; e.g., stable policies exist only when the minimum time for 
traversing the TSP tour grows strictly sub-linearly with n. For the DTRP problem we propose 
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novel policies and show their stability for a uniform target-generation process with intensity A. It 
is clear from the table that motion constraints make the system much more sensitive to increases 
in the target generation rate A. 

11. Setup and worst-case DITSP 
For (i e N, consider a vehicle with double integrator dynamics: 

Pit) = u{t), \\u{t)\\ < rct„ \\p{t)\\ < r,ei, (1) 

where p E M.'^ and u E are the position and control input of the vehicle, rvei E IR+ and 
Tctr E IR+ are the bounds on the attainable speed and control inputs. Let Q C M*^ be a unit 
hypercube. Let P = {qi, . . . , g„} be a set of n points in Q and Vn be the collection of all point 
sets P C Q with cardinality n. Let ETSP(P) denote the cost of the Euclidean TSP over P and 
let DITSP(P) denote the cost of the TSP for double integrator over P, i.e., the time taken to 
traverse the fastest closed path for a double integrator through all points in P. We assume r^ei 
and Tcti to be constant and we study the dependence of DITSP: Vn IR+ on n. Without loss of 
generality, we assume the vehicle starts traversing the TSP tour at t = with initial position qi. 



Lemma 2.1: (Worst-case Lower Bound on the TSP for Double Integrator) For rvei,Tctr ^ 1^+ 
and (i e M, there exists a point set P G "P^ in Q C M'' such that DITSP(P) belongs to Vl{n^^^). 

Proof: We consider the class of point sets that give rise to the worst case scenario for the 
ETSP; we refer the reader to [2]. It suffice to note that, for such a point set of cardinality n 
in W'-, the minimum distance between any two points belongs to VL{n~^). The minimum time 
required for a double integrator with initial speed v to go from one point to another at a distance 
5 is lower bounded by ^ [v/rctrY + 2(5/rctr) — v/vctx- However, v < r^^i and for the point set 
under consideration, S belongs to This implies that the minimum time required for a 

double integrator to travel between two points of the given point set belongs to Hence, 
the minimum time required for the vehicle to complete the tour over this point set belongs to 
nQ{n^^), i.e., Q{n^~^). ■ 

We now propose a simple strategy for the DITSP and analyze its performance. The STOP- 
GO-STOP strategy can be described as follows: The vehicle visits the points in the same order 
as in the optimal ETSP tour over the same set of points. Between any pair of points, the vehicle 



February 1, 2008 



DRAFT 



5 



Starts at the initial point at rest, follows the shortest-time path to reach the final point with zero 
velocity. 

Theorem 2.2: (Upper Bound on the TSP for Double Integrator) For any point set P EVn ^ti 
Q C and r^^, > 0, r^d > and d G N, DITSP(P) belongs to 0{n^-^. 

Proof: Without any loss of generality, let (gi, . . . , gi) be the optimal order of points for 
the Euclidean TSP over P. For 1 < i < n — 1, let 5i = \\qi — qi+i\\ and 5„ = ||g„ — If Si 
is the distance between a set of points, then the time ti required to traverse that distance by a 
double integrator following the STOP-GO-STOP strategy is given by: 

if 6i < ^, 

^ + otherwise. 

Let I={l<i<n\5i< r^gi/rctr} and X'^ = {1, . . . , n} \ X. Also, let nj be the cardinality 
of the set X and let njc = n — nj. Therefore, an upper bound on the minimum time taken to 
complete the tour as obtained from this strategy is 



o 1 

DITSP(P) < V = V + V = 5^ + ^ + — V 5, 

i=l ieX iel'^ V cn .^j ctr vei .^j^ 

< —=2_^V0i + njcl \ j 



. / V ' '"j.^ \ I / ) 

'"ctr ^ '"ctr '"vel ^ 

where diam(Q) is the length of the largest segment lying completely inside Q. From the well 
known upper bound [2] on the tour length of optimal ETSP, there exists a constant /? ( Q) such 
that Xliej ^ J2^=i ^ P{Q)'n'^^'^. Hence an upper bound on the term Xliex in eqn. Q 
can be obtained by solving the following optimization problem: 

maximize y^, subj. to Sj < l3{Q)n^~d. 

By employing the method of Lagrange multipliers, one can see that the maximum is achieved 



when 6i = p{Q)^^-^ Vi G X. Hence Y^iei < \/ P{Q)\I rijin^'^ . Substituting this in eqn. 
©, we get that 



^ 2^(3{Q) r- ^-^ ^ r-' ^ diam(Q)x 
Dlibr(P) < — — yjnjn^ 2d -|- njc ( 1 1. 

V^'ctr ^''ctr ''vel ' 



(3) 

However, rij < n and Lemma |231 implies that rijc belongs to 0{n^~^). Incorporating these facts 
into eqn. Q, one arrives at the final result. ■ 
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The above theorem relies on the following key result. 

Lemma 2.3: Given any point set P G P„ in Q C W^, if {qi,q2, ■ ■ ■ ,(ln,Qi) is the order of 
points for the optimal ETSP tour over P, then for any t] E M+, the cardinality of the set 
{li G P I hi - Qi+iW > V} belongs to 0{n^-^). 

Proof: By contradiction, assume there exists f] E M+ such that the cardinality of {pi E 
P I Wli > v} belongs to for some e > 0. This implies that ETSP(P) belongs 

to f/ X n{n^-2+') = n{n^-i+'). However, we know from [2] that ETSP(P) E 0{n^-^). U 

III. The stochastic DITSP 

The results in the previous section showed that based on a simple strategy, the STOP-GO- 
STOP strategy, we are already guaranteed to have sublinear cost for the DITSP when the point 
sets are considered on an individual basis. However, it is reasonable to argue that there might be 
better algorithms when one is concerned with average performance. In particular, one can expect 
that when n target points are stochastically generated in Q according to a uniform probability 
distribution function, the cost of DITSP should be lower than the one given by the STOP-GO- 
STOP strategy. We shall refer to the problem of studying the average performance of DITSP 
over this class of point sets as stochastic DITSP. In this section, we present novel algorithms 
for stochastic DITSP and then establish bounds on their performances. 

We make the following assumptions: in R^, Q is a rectangle of width W and height H with 
W > H; in M^, Q is a rectangular box of width W, height H and depth D with W > H > D. 
Different choices for the shape of Q affect our conclusions only by a constant. The axes of 
the reference frame are parallel to the sides of Q. The points P = (pi, . . . ,Pn) are randomly 
generated according to a uniform distribution in Q. 

A. Lower bounds 

First we provide lower bounds on the expected length of stochastic DITSP for the 2 and 3 
dimensional case. 

Theorem 3.1: (Lower bounds on stochastic DITSP) For all ryei > and r^tr > 0, the expected 
cost of a stochastic DITSP visiting a set of n uniformly-randomly-generated points satisfies the 
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following inequalities: 

E[DITSP(P C Q C M^)] 3/6WH\y3 
hm — > - and 

E[DITSP(P CQC M:^)] ^ 5 / 20WHD y/5 

n^+oo ~ 6 V Trrvel^^tr ^ 

Proof: We first prove the first inequality. Choose a random point qi E P as the initial 
position and Vi as the initial speed of the vehicle on the tour, and choose the heading randomly. 
We would like to compute a bound on the expected time to the closest next point in the tour; 
let us call such a time t*. To this purpose, consider the set of points that are reachable by a 
second order vehicle within time t . It can be verified that the area of such a set can be bounded, 
as t 0+, by 

Area(i?,) < + o{t') < '-^^ + oi^). (4) 

o 

Given time t, the probability that t* > t is no less than the probability that there is no other 
target reachable within a time at most t; in other words, 

T3 r+* ^ +1 ^ 1 Area(i?t) r^^r^^it^ . 

rr t > t > 1 — n— ; — - > 1 — n o(t ). 

^ J - Area(Q) " QWH ^ ^ 



In terms of expectation, defining c 



+ 00 



Jo 



-1/3 /.c-l/^ 



3 / 6WH \ 







4 V'^vel'^ctr'T'. 

The expected total tour time will be no smaller than n times the expected shortest time between 
two points, i.e., 

E[DITSP(P)r.e,,re,2] > ^ (^1^^^ _ ^(n^/^). 

4 V '"vel^ctr / 

Dividing both sides by n'^^^ and taking the limit as n ^ +oo, we get the first result. 

We now prove the second inequality. Choose a random point qt E P as the initial position and 
Vi as the initial speed of the vehicle on the tour, and choose the heading randomly. We would 
like to compute a bound on the expected time to the closest next point in the tour; let us call 
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such a time t*. To this purpose, consider the set Rt of points that are reachable by a second 
order vehicle within time t . It can be verified that the volume of such a set can be bounded, as 
t 0+, by 

Volume(i?,) < + o(t^) < ^^^^^ + o(t^). (5) 

Given time t, the probability that t* > t is no less than the probability that there is no other 
target reachable within a time at most t; in other words, 

Pr[r > t] > 1 - n^^^^^ > 1 - - o{t% 

^ ^ - Volume(Q) " 2m HD ^ ' 

2 

In terms of expectation, defining c = ^^^'^'g , 

-+00 



E[t*] = / Pr(r>Oc?e 

^0 



'0 

> max |o, 1 - _ ^(^5) , 

> / [i-ce)di-n o{e)di 

Jo Jo 



'0 JO 

1/5 

-1/5N 



o[n 



5 /20WHD\ 

The expected total tour time will be no smaller than n times the expected shortest time between 
two points, i.e., 

6 V '^vel'^ctr / 

Dividing both sides by n^/^ and taking the limit as n ^ +oo, we get the second result. 



B. Constructive upper bounds 

In this section, we first recall the RECURSIVE Bead-Tiling Algorithm from our earlier 
work [14] on Dubins vehicle and use it to propose novel algorithms for the stochastic DITSP: 
the Recursive Bead-Tiling Algorithm for and Recursive Cylinder-Covering 
Algorithm for The performances of these algorithms will be shown to be within a constant 
factor of the optimal with high probability. 

In [14], we studied stochastic versions of TSP for Dubins vehicle. Though conventionally 
Dubins vehicle is restricted to be a planar vehicle, one can easily generalize the model even 
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for the three (and higher) dimensional case. Correspondingly, Dubins vehicle can be defined as 
a vehicle that is constrained to move with a constant speed along paths of bounded curvature, 
without reversing direction. Accordingly, a feasible curve for Dubins vehicle or a Dubins path is 
defined as a curve that is twice differentiable almost everywhere, and such that the magnitude of 
its curvature is bounded above by 1/p, where p > is the minimum turn radius. Based on this, 
one can immediately come up with the following analogy between feasible curves for Dubins 
vehicle and a double integrator. 

Lemma 3.2: (Trajectories of Dubins and double integrators) A feasible curve for Dubins 
vehicle with minimum turn radius p > is a feasible curve for a double integrator (modeled 
in equation dU)) moving with a constant speed ^pr^. Conversely, a feasible curve for a double 
integrator moving with a constant speed s < r\^i is a feasible curve for Dubins vehicle with 
minimum turn radius — . 

ren- 
in [12], we proposed a novel algorithm, the Recursive Bead-Tiling Algorithm for the 
stochastic version of the Dubins TSP (DTSP) in M?; we showed that this algorithm performed 
within a constant factor of the optimal with high probability. In this paper, taking inspiration 
from those ideas, we propose algorithms to compute feasible curves for a double integrator 
moving with a constant speed. Note that moving at the maximum speed r^ei is not necessarily 
the best strategy since it restricts the maneuvering capability of the vehicle. Nonetheless, this 
strategy leads to efficient algorithms. In what follows we assume that the double integrator is 
moving with some constant speed s < r^^i- Next, we proceed towards devising strategies which 
perform within a constant factor of the optimal for stochastic DITSP in as well as M'^, both 
with high probability. 

1) The basic geometric construction: Here we define useful geometric objects and study their 
properties. Given the constant speed s for the double integrator let p = from Lemma IT2l this 
constant corresponds to the minimum turning radius of the analogous Dubins vehicle. Consider 
two points p_ and p+ on the plane, with i = ||j9+ — < 4p, and construct the bead Bp{i) as 
detailed in Figure [l] 

Associated with the bead is also the rectangle efgh. Rotating this rectangle about the line 
passing through j9_ and j9+ gives rise to a cylinder The regions Bp{£) and Cp{£) enjoy the 

following asymptotic properties as (//p) 0+: 
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Fig. 1. Construction of the "bead" Bp{£). The figure shows how the upper half of the boundary is constructed, the bottom half 
is symmetric. The figure shows the rectangle efgh which is used to construct the "cylinder" Cp{£). 



(PI) The maximum "thickness" of Bp{i) is equal to 



The radius of cross-section of Cp{i) is w;(£)/4 and the length of Cp{i) is i. 
(P2) The area of Bp{i) is equal to 

The volume of Cp{t) is equal to 

(P3) For any 'p E Bp, there is at least one feasible curve 7p through the points 
entirely contained within Bp. The length of any such path is at most 

Length(7p) < 4parcsin ( — =i + p- o[ — ]. 

\4pJ VpV 

Analogously, for any p E Cp, there is at least one feasible curve 7p through the points 
{p-,p,p+}, entirely contained within the region obtained by rotating Bp{i) about the line 
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passing through p_ and p^. The length of 7^ satisfies the same upper bound as the one 
established for 7^. 

The geometric shapes introduced above can be used to cover and in an organized way. 
The plane can be periodically tiled^ by identical copies of Bp{i), for any i g]0, 4p]. The cylinder, 
however does not enjoy any such special property. For our purpose, we consider a particular 
covering of by cylinders described as follows. 



Fig. 2. A typical layer of cylinders formed by stacking rows of cylinders 

A row of cylinders is formed by joining cylinders end to end along their length. A layer of 
cylinders is formed by placing rows of cylinders parallel and on top of each other as shown 
in Figure El For covering M^, these layers are arranged next to each other and with offsets as 
shown in Figure Ofa), where the cross section of this arrangement is shown. We refer to this 
construction as the covering ofM.^. 

2) The 2D case: The RECURSIVE Bead-Tiling Algorithm (RecBTA).- Consider a tiling 
of the plane such that Area[Bp{£)] = Area[Q C R^]/{2n) = WH/{2n); to obtain this equality 
we assume £ to be a decreasing function of n such that i{n) < 4p. Furthermore, we assume the 
tiling is chosen to be aligned with the sides of Q C M^, see Figure IH 

^A tiling of the plane is a collection of sets whose intersection has measure zero and whose union covers the plane. 
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Fig. 3. (a): Cross section of the arrangement of the layers of cylinders used for covering Q C (b): The relative position 
of the bigger cylinder relative to smaller ones of the prior phase during the phase transition. 



The proposed algorithm consists of a sequence of phases; during each of these phases, a 
feasible curve will be constructed that "sweeps" the set Q. In the first phase, a feasible curve is 
constructed with the following properties: 

(i) it visits all non-empty beads once, 

(ii) it visits all rows^ in sequence top-to-down, alternating between left-to-right and right-to-left 
passes, and visiting all non-empty beads in a row, 

(iii) when visiting a non-empty bead, it services at least one target in it. 

In order to visit the outstanding targets, a new phase is initiated. In this phase, instead of 
considering single beads, we will consider "meta-beads" composed of two beads each, as shown 
in Figure m and proceed in a similar way as the first phase, i.e., a feasible curve is constructed 
with the following properties: 

(i) the curve visits all non-empty meta-beads once, 

(ii) it visits all (meta-bead) rows in sequence top-to-down, alternating between left-to-right 
and right-to-left passes, and visiting all non-empty meta-beads in a row, 

(iii) when visiting a non-empty meta-bead, it services at least one target in it. 

'a row is a maximal string of beads with non-empty intersection with Q. 
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This process is iterated at most logg n + 1 times, and at each phase meta-beads composed 
of two neighboring meta-beads from the previous phase are considered; in other words, the 
meta-beads at the i-th phase are composed of 2*^^ neighboring beads. After the last phase, the 
leftover targets will be visited using, for example, a greedy strategy. 



Fig. 4. Sketch of "meta-beads" at successive phases in the recursive bead tiling algorithm. 

The following result is related to a similar result in [15]. 

Theorem 3.3 (Targets remaining after recursive phases): Let P EVn^^ uniformly randomly 
generated in Q E M?. The number of unvisited targets after the last recursive phase of the 
RecBTA is less than 24 log2 n with high probability, i.e., with probability approaching one as 

n -foo. 

Proof: Associate a unique identifier to each bead, let b(t) be the identifier of the bead in 
which the target is sampled, and let h{t) E N be the phase at which the target is visited. 
Without loss of generality, assume that targets within a single bead are visited in the same order 
in which they are generated, i.e., if b{ti) = 6(^2) and ti < t2, then h{ti) < h{t2). Let Vi{t) be the 
number of beads that contain unvisited targets at the inception of the phase, computed after 
the insertion of the target. Furthermore, let be the number of z"^ phase meta-beads (i.e., 
meta-beads containing 2*^^ neighboring beads) with a non-empty intersection with Q. Clearly, 
Vi{t) < Vi{n), nii < 2mj+i, and vi{n) < n < mi/2 with certainty. The target will not be 
visited during the first phase if it is sampled in a bead that already contains other targets. In 
other words, 

P, > 21 . !^ < ^ < i. 

^ mi In 1 

Similarly, the t"^ target will not be visited during the phase if (i) it has not been visited before 
the pass, and (ii) it belongs to a meta-bead that already contains other targets not visited 
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before the phase: 

Pr [hit) >i+l\ iviit-l),v^.l{t-l),Vlit-l 

= Pr [h{t) > i + 1| h{t) > i,v,{t - 1)] ■ Pr [h{t) > i\ {vi-^i{t - 1), . . .,Vi{t - 1 
< ^^^^tl^l p^ihit) > z\ (i;,„i(t -l),...,v,{t- 1))] 



rrii 



n 



Vj{t-1) ^2^-\{n) 



mi 



<n 



2ri 



2^ 



Given a sequence {AjieN C M+ and given a fixed i > 1, define a sequence of binary random 
variables 

1, if hit) > z + 1 and Vi{t - 1) < A^, 
0, otherwise. 

In other words, 1^ = 1 if the target is not visited during the first i phases even though the 
number of beads still containing unvisited targets at the inception of the phase is less than 
PiU. Even though the random variable Yf depends on the targets generated before the target, 
the probability that it takes the value 1 is bounded by 



Vi[Y^ = 1| 6(1), 6(2), . . . , 6(t - 1)] < 2^ n 

regardless of the actual values of 6(1), . . . , 6(t — 1). It is known [15] that if the random variables 
Yt satisfy such a condition, the sum Yt is stochastically dominated by a binomially distributed 
random variable, namely. 



Pr 



Y,Y,>k 



t=i 



< Fi[B{n,qi) > k]. 



In particular. 



Pr 



J2Yt> 2nq, 



t=i 



< Pr[5(n, Qi) > 2npi] < 2""'?'/^ 



(6) 



where the last inequality follows from Chemoff's Bound [16]. Now, it is convenient to define 

{AjisN by 

A = 1, A+i = 2g. = 2-^+^ I I = T--' 



2q. = 2^^'llP, 
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which leads to Pi = 2^ \ In turn, this implies that equation Q can be rewritten as 



Pr 



t=i 



< 



which is less than for i < i*{n) := [log2?^ — log2log2n — log2 6J < log2?^. Note that 
A < 12 for all i > i*{n). 

Let £i be the event that Vi{n) < jSiU. Note that if Si is true, then Vi+i{n) < Ym^=i ^t" the right 
hand side represents the number of targets that will be visited after the i'*^ phase, whereas the 
left hand side counts the number of beads containing such targets. We have, for all i < i*{n): 



Pr 



that is, PiK.+il S^] < 



Vi+i > Si ■ PT[Si] < Pr 

1 



in 



n2 FilSi] 



t=i 

and thus (recall that Si is true with certainty): 



1 



Pr h^i+i] < ^ + Pr[-^,] < ^. 



In other words, for all i < i*{n), Vi{n) < Piji with high probability. 

Let us now turn our attention to the phases such that i > i*{n). The total number of targets 
visited after the phase is dominated by a binomial variable B{n, 12 logg n/n); in particular. 



Pr 



n 

Vi.+i > 241og2n| Si,j ■ Pt[S,,] < Pi [J^F* > 241og2n 

t=i 



< Pr [B{n, 12 log^n/n) > 24 logs n] < 2"^2'°S2". 



Dealing with conditioning as before, we obtain 



Pr [tv+i > 24 log2 n]<— + Fi[^Si,] < — ^ 



In other words, the number of targets that are left unvisited after the (z*)* phase is bounded by 
a logarithmic function of n with high probability. ■ 
In summary. Theorem 13.31 says that after a sufficiently large number of phases, almost all 
targets will be visited, with high probability. The second key point is to recognize that (i) the 
length of the first phase is of order ri^^^ and (ii) the length of each phase is decreasing at 
such a rate that the sum of the lengths of the [log2 n] recursive phases remains bounded and 
proportional to the length of the first phase. (Since we are considering the asymptotic case in 
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which the number of targets is very large, the length of the beads will be very small; in the 
remainder of this section we will tacitly consider the asymptotic behavior as £/p — O'^.) 

Lemma 3.4 (Path length for the first phase): Consider a tiling of the plane with beads of 
length i. For any p > and for any set of target points, the length Li of a path visiting 
once and only once each bead with a non-empty intersection with a rectangle Q of width W 
and length H satisfies 



< 1 + o^r 



3 

Proof: A path visiting each bead once can be constructed by a sequence of passes, during 
which all beads in a row are visited in a left-to-right or right-to-left order. In each row, there are 
at most \W/i~\ + 1 beads with a non-empty intersection with Q. Hence, the cost of each pass 
is at most: 

^pass ^^^^2e + p-o(^ 

Two passes are connected by a U-turn maneuver, in which the direction of travel is reversed, 
and the path moves to the next row, at distance equal to one half the width of a bead. The length 
of the shortest path to reverse the heading of the vehicle with co-located initial and final points 
is (7/3)7rp, the length of the U-turn satisfies 

The total number of passes, i.e., the total number of rows of beads with non-empty intersection 
with Q, satisfies 



16pH fp 



' 2H ■ 

A simple upper bound on the cost of closing the tour is given by 

^ciosc < + 2£) + {H + 2w{i)) + 27rp = W + H + 27ip + 2i + p ■ o{i/p). 
In summary, the total length of the path followed during the first phase is 

< (^l^ + 2 + o(0^ (^i^ + 2£+Ivrp+^ + p-o(^^^ +iy + i/ + 27rp + 2£ + p-o(£/p) 
IGpWH f,^7 p\ (p 
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Based on the calculation for the first phase, we can estimate the length of the paths in generic 
phases of the algorithm. Since the total number of phases in the algorithm depends on the number 
of targets n, as does the length of the beads i, we will retain explicitly the dependency on the 
phase number. 

Lemma 3.5 (Path length at odd-numbered phases): Consider a tiling of the plane with beads 
of length i. For any p > and for any set of target points, the length Iv2j-i of a path visiting 
once and only once each meta-bead with a non-empty intersection with a rectangle Q of width 
W and length H at phase number (2j — 1), j G N satisfies 



pWH 



77rp\ fp 



32—— + p ■ o 



2^ 



3^ + p ■ o 



i 

P/i 



Proof: During odd-numbered phases, the number of beads in a meta-bead is a perfect 
square and the considerations made in the proof of Lemma 13.41 can be readily adapted. The 
length of each pass satisfies 



LP|!^\ <{W + 2H) 



1 



The length of each U-tum maneuver is bounded as 

7 o ,7 



e re 



6p 



p" 



from which 



U— turn 
2j-l 



w 



7 



-vrp + o 



2^ 



+ p - o 



The number of passes satisfies: 



pH 



o - 



2. 



Finally, the cost of closing the tour is bounded by 



Lt^-x <W + H + 2Tip + 2^[l + p- o(£/p)] 



Therefore, a bound on the total length of the path is 



L 



U— turn\ 
2j~l ) 



+ L 



close 

2i-i 



pWH 



3W 



3i + p-o\^- 
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Lemma 3.6 (Path length at even-numbered phases): Consider a tiling of the plane with beads 
of length ^. For any p > 0, a rectangle Q of width W and length H and any set of target points, 
paths in each phase of the Bead-Tiling Algorithm can be chosen such that < 2L2j+i, 
for all j e N. 

Proof: Consider a generic meta-bead i?2j+i traversed in the (2j + 1)*'^' phase, and let l-^ 
be the length of the path segment within i?2j+i- The same meta-bead is traversed at most twice 
during the {2jY^ phase; let li, I2 be the lengths of the two path segments of the (2j)"^ phase 
within i?2i+i- By convention, for i E {1,2,3}, we let = if the i^^ path does not intersect 
i?2j+i- Without loss of generality, the order of target points can be chosen in such a way that 
h < h < h, and hence /1 + /2 < 2/3. Repeating the same argument for all non-empty meta-beads, 
we prove the claim. ■ 

Finally, we can summarize these intermediate bounds into the main result of this section. 
We let Lrbta,p(-P) denote the length of the path computed by the Recursive Bead-Tiling 
Algorithm for a point set P. 

Theorem 3.7 (Path length for the RECURSIVE Bead-Tiling Algorithm).- Let P g Vn be 
uniformly randomly generated in the rectangle of width W and height H. For any p > 0, with 
high probability 

n— >+oo n / n ' \ 

Proof: For simplicity we let Lrbta,p(^) = ^rbta- Clearly, Lrbta = ^'rbta + ^rbta' 
where L'^^j,^ is the path length of the first [log2 n] phases of the Recursive Bead-Tiling Algorithm 
and Lgrp^ is the length of the path required to visit all remaining targets. An immediate conse- 
quence of Lemma 13.61 is that 

riog2(n)l riog2('^)/2l 

-^RBTA = - 3 ^ 

i=l j=l 

The summation on the right hand side of this equation can be expanded using Lemma 13.51 



— IT 

3 W 
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yielding 



-^RBTA — 3 



pWH 



7np\ (p^ 



32f + p-o(^ 



riog2(n)/2l 



\0g2n 



riog2(n)/2l 

+ [M + p-o{i/p)] 2^' 



Since Ej=i 2"-' < Ej^ 2"^ = 1, and Ej=i 2^ = 2*^+1 - 2 < 2*^+1, the previous equation can 
be simplified to 



-f'RBTA < 3 32 



pWH 



rvrp 
3M/ 



+ ( 32^ + P-0 



log2n' 



+ [3i + p- o{i/p)] ■ (4v/^) 



Recalling that i = 2{pWH/n 



,1/3 



^{n ^/^) for large the above can be rewritten as 



+ o(n2/3). 



Now it suffices to show that -^^rbta negligible with respect to -^^rbta large n with high 
probability. From Theorem 13.31 we know that with high probability there will be at most 
241og2n unvisited targets after the [logan] recursive phases. From [13] we know that, with 
high probability, the length of a ALTERNATING ALGORITHM tour through these points satisfies 

^RBTA < '^[12 log2 nlvrp + o(log2 n) . 



In order to obtain an upper bound on the DITSP(P), we derive the expression for time taken, 
TjiecBTAj by the RecBTA to execute the path of length Lp{BTA.p(^) and then optimize it with 
respect to p. Based on this calculation, we get the following result. 

Theorem 3.8: (Upper bound on the total time in M?) Let P G P„ be uniformly randomly 
generated in the rectangle of width W and height H. For any double integrator ([T]), with high 
probability 



lim < 24 



V 3W 

Remark 3.9: Theorems 13.11 and 13.81 imply that, with high probability, the RecBTA is a 
( 1 + g^Tj^j -factor approximation (with respect to n) to the optimal stochastic DITSP in 
and that E[DITSP(P cQc M^)] belongs to Qin"^/^). 
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3) The 3D case: The Recursive Cylinder-Covering Algorithm (REcCCAj; Con- 
sider a covering of Q G by cylinders such that Volume[Cp(£)] = Volume[Q C M'^]/(4n) = 
WHD/{An) (Again implying that n is sufficiently large). Furthermore, the covering is chosen 
in such a way that it is aligned with the sides of Q C M'^. 

The proposed algorithm will consist of a sequence of phases; each phase will consist of five 
sub-phases, all similar in nature. For the first sub-phase of the first phase, a feasible curve is 
constructed with the following properties: 

(i) it visits all non-empty cylinders once, 

(ii) it visits all rows of cylinders in a layer in sequence top-to-down in a layer, alternating 
between left-to-right and right-to-left passes, and visiting all non-empty cylinders in a row, 

(iii) it visits all layers in sequence from one end of the region to the other, 

(iv) when visiting a non-empty cylinder, it services at least one target in it. 




Fig. 5. Starting from top left in the left-to-right, top-to bottom direction, sketch of projection of "meta-cylinders" on the 
corresponding side of Q C R'^ at second, third, fourth and fifth sub-phases of a phase in the recursive cylinder covering 
algorithm. 

In subsequent sub-phases, instead of considering single cylinders, we will consider "meta- 
cylinders" composed of 2, 4, 8 and 16 beads each for the remaining four sub-phases, as shown 
in Figure|5l and proceed in a similar way as the first sub-phase, i.e., a feasible curve is constructed 
with the following properties: 

(i) the curve visits all non-empty meta-cylinders once, 

(ii) it visits all (meta-cylinder) rows in sequence top-to-down in a (meta-cylinder) layer, 
alternating between left-to-right and right-to-left passes, and visiting all non-empty meta- 
cylinders in a row, 
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(iii) it visits all (meta-cylinder) layers in sequence from one end of the region to the other, 

(iv) when visiting a non-empty meta-cylinder, it services at least one target in it. 

A meta-cylinder at the end of the fifth sub-phase, and hence at the end of the first phase will 
consist of 16 nearby cylinders. After this phase, the transitioning to the next phase will involve 
enlarging the cylinder to 32 times its current size by increasing the radius of its cross section by 
a factor of 4 and doubling its length as outlined in Figure |3tb). It is easy to see that this bigger 
cylinder will contain the union of 32 nearby smaller cylinders. In other words, we are forming 
the object Cp{2i) using a conglomeration of 32 Cp{i) objects. This whole process is repeated at 
most log2 n + 2 times. After the last phase, the leftover targets will be visited using, for example, 
a greedy strategy. 

We have the following results, which are similar to the one for the RECURSIVE Bead-Tiling 
Algorithm. 

Theorem 3.10 (Targets remaining after recursive phases): Let P &Vn^Q uniformly randomly 
generated in Q C M'^. The number of unvisited targets after the last recursive phase of the 
Recursive Cylinder-Covering Algorithm over P is less than 24 log2 n with high prob- 
ability, i.e., with probability approaching one as n — > +oo. 

Lemma 3.11 (Path length for the first sub phase): Consider a covering of the space with cylin- 
ders Cp{i). For any p > and for any set of target points, the length Lj of a path executing the 
first sub-phase of the Recursive Cylinder-Covering Algorithm in a rectangular box Q 
of width W, height H and depth D satisfies 



Proof: A path visiting each cylinder once can be constructed by a sequence of passes, 
during which all cylinders in a row are visited by making left-to-right and then right-to-left 
passes. This is done for all the rows of cylinders. In each row, there are at most \W/f\ + 1 
cylinders encountered in one pass. Hence, the cost of each pass is at most: 



In order to visit all cylinders in a row, the vehicle needs to make two passes through that row 





and the paths for these two passes are connected by a u-turn path whose length is ^np + |. 
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Therefore the length of the path required to visit all cylinders in one row is: 

9 7 ( 

During the transition from one row to another, the vehicle needs to make a U-tum maneuver, in 
which the direction of travel is reversed, and the path moves to the next row, at distance equal 
to the diameter of the cylinder. Since the length of the shortest path to reverse the heading of 

the vehicle with co-located initial and final points is (7/3)7rp, the length of the U-turn satisfies 

7 1 7 /2 f 

LT'''^'' < ^vrp + -w{i) <is^P + T^ + p-o{-A. (7) 



3 2 ' ' - 3 16p \p\ 
The total number of rows, i.e., the total number of rows of cylinders with non-empty intersection 

with Q, satisfies 

Nr<\^]+i<^-^+o(p). 

' - w{i) - ^ \iJ 

During the transition from one row to another, the vehicle needs to make a U-tum maneuver 

whose length satisfies the same bound as in Eq. Q The total number of layers of cylinders 

satisfies 



flayer ^ 



AD 



T < 32pD ^ 



A simple upper bound on the cost of closing the tour is given by 

^ciose < (y^ + 2i) + {H + 2w{i)) + {D + w{£)) + 2Tip = W + H + D + 2Tip + 21 + p ■ o{l/p). 

In summary, the total length of the path followed during the first sub-phase is 

Li < nY'"''" {NY"^ {L'r + i^y"*"™) + Ly-*"™) + L^'°''" 
102Ap^WHD f 7 p\ fp^ 

Based on this calculation, we can estimate the length of the paths in subsequent sub-phases. 

1024pWM / 7 p \ /p3 

^-^^^ (^ + 34j+^-HF 

512pWM/ 7 p\ /p3 
^ni< J, ^1 + 3^^;+/^-^^ 

^ ^ ^Vlp'WHD A 7 p \ /p3\ 
256p'^WHD A 7 p \ fp^\ 
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The length of path to execute the first phase is then the some of the path lengths for these 
five sub-phases. 

Lemma 3.12 (Path length at the first phase): Consider a covering of the space with cylinders 
Cp{£). For any p > and for any set of target points, the length Li of a path visiting once and 
only once each cylinder with a non-empty intersection with a rectangular box Q of width W, 
height H and depth D satisfies 

3328pWM/ 7 p\ (p^ 

— ^ — V^rw)^p-'\T^ 

Since we increase the length of cylinders by a factor of two while doing the phase transtion 
from one phase to the another, the length of path for the subsequent phase is given by: 

Finally, we can summarize these intermediate bounds into the main result of this section. We let 
Lrcfa,p(-P) denote the length of the path computed by the RECURSIVE Cylinder-Covering 
Algorithm for a point set P. 

Theorem 3.13 (Path length for the RECURSIVE Cylinder-Covering Algorithm).- LetP e 
Vn be uniformly randomly generated in the rectangle of width W, height H and depth D. For 
any p > 0, with high probability 

DITSP(PC Qcg) ^ ^ m(^)"\,^wHDY,^, 



Proof: Clearly, 



|-log[2]n + 7-| 



4 = 1 



3328p'^WHD / 7 p\ f p^ 



\ / dOZ,Op VV II I p 

lkcfa= [ — — v^rw 



53248pWiJD f 7 p\ fp^ 

- — — v^rw)^p-''[e 



Recalling that i = 2 ^ ^^^ j + ^l"^ ^^^) large n, the above can be rewritten as 

■ 

Theorem 3.14: (Upper bound on the total time in W?) Let P G be uniformly randomly 
generated in the rectangular box of width W, height H and depth D. For any double integrator 
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([T]), with high probability 

Remark 3.15: Theorems 13.11 and 13.141 imply that, with high probability, the RecCCA is a 
1 + ^ 1 -factor approximation (with respect to n) to the optimal stochastic DITSP in W 
and that E[DITSP(P C Q C M^)] belongs to Q{n^/^). 

IV. The DTRP for double integrator 

We now turn our attention to the Dynamic Traveling Repairperson Problem (DTRP) that was 
introduced in [6] and that we here tackle for a double integrator. 

A. Model and problem statement 

In the DTRP the double integrator is required to visit a dynamically growing set of targets, 
generated by some stochastic process. We assume that the double integrator has unlimited range 
and target- servicing capacity and that it moves at a unit speed with minimum turning radius 
p > 0. 

Information about the outstanding targets representing the demand at time t is described by 
a finite set n{t) of positions V{t). Targets are generated, and inserted into V, according to a 
time-invariant spatio-temporal Poisson process, with time intensity A > 0, and uniform spatial 
density inside the region Q, which we continue to assume to be a rectangle for two dimensions 
and a rectangular box for three dimensions. Servicing of a target and its removal from the set 
V, is achieved when the double integrator moves to the target position. A control policy $ for 
the DTRP assigns a control input to the vehicle as a function of its configuration and of the 
current outstanding targets. The policy $ is a stable policy for the DTRP if, under its action 

n$ = lim ^[n{t) \ p = P)] < +oo, 

4— >+oo 

that is, if the double integrator is able to service targets at a rate that is, on average, at least as 
fast as the rate at which new targets are generated. 

Let Tj be the time elapsed from the time the target is generated to the time it is serviced 
and let T$ := limj^+oo E[Tj] be the steady-state system time for the DTRP under the policy $. 
(Note that if the system is stable, then it is known [17] that = AT$.) Clearly, our objective 
is to design a policy $ with minimal system time r$. 
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B. Lower and constructive upper bounds 

In what follows, we design control policies that provide constant-factor approximation of the 
optimal achievable performance. Consistently with the theme of the paper, we consider the case 
of heavy load, i.e., the problem as the time intensity A +00. We first provide lower bounds 
for the system time, and then present novel approximation algorithms providing upper bound on 
the performance. 

Theorem 4.1 (Lower bound on the DTRP system time): For a double integrator ([T]), the sys- 
tem time Tdtrp,2 and Tdtrp.s for the DTRP in two and three dimensions satisfy 

Tdtrp,2 ^ 81 W Tdtrp.s ^ 7813W^iro 
lim — — > , lim — — > 5-. 

A^oo A"' 32 Tveirctr A^oo A* 972 rveir4- 

Proof: We prove the lower bound on Tdtrp,2; the bound on Tdtrp,3 follows on similar lines. 
Let us assume that a stabilizing policy is available. In such a case, the number of outstanding 
targets approaches a finite steady-state value, n* , related to the system time by Little's formula, 
i.e., n* = ATdtrp,2- In order for the policy to be stabilizing, the time needed, on average, to 
service m targets must be no greater than the average time interval in which m new targets are 
generated. The average tim needed by the double integrator to service one target is no gretaer 
than the expected minimum time from an arbitrarily placed vehicle to the closest target; in other 
words, we can write the stability condition E[t*(n*)] < 1/A. A bound on the expected value of 
t* has been computed in the proof of Theorem 13.11 yielding 

3 / QWH \ 

' ^ < E[f{n*)] < 1/A. 



4 V'^vel'^ctr'^. 

Using Little's formula n* = ATdtrp,2, and rearranging, we get the desired result. ■ 
We now propose simple strategies, the Bead Tiling Algorithm (for R^) and the Cylin- 
der Covering Algorithm (for M^), based on the concepts introduced in the previous section. 
The Bead Tiling Algorithm (BTA) strategy consists of the following steps: 

(i) Tile the plane with beads of length £ := minjCeTA/A, 4p}, where 

Cbta = 0.5241rvei (^1 + . (8) 

(ii) Traverse all non-empty beads once, visiting one target per non-empty bead. Repeat this 
step. 



February 1, 2008 



DRAFT 



26 



CccA = 0.1615rvei 1 



The Cylinder Covering Algorithm (CCA) strategy is akin to the BTA, where the region 
is covered with cylinders constructed from beads of length i := min{CcFA/A, 4p}, where 

The policy is then to traverse all non-empty cylinders once, visiting one target per non-empty 
cylinder. The following result characterizes the system time for the closed loop system induced 
by these algorithms and is based on the bounds derived to arrive at Theorems 13.81 and 13.141 

Theorem 4.2 (Upper bound on the DTRP system time): For a double integrator ([T]) and A > 
0, the BTA and the CCA are stable policies for the DTRP and the resulting system times Tbxa 
and TcFA satisfy: 

^- Ti/w / 77rr2^j 



lim ^^f^ < lim ^ < 70.5^ ( 1 

A^oo A"' A^oo ^"vel^'ctr V 



5 



^DTRP 3 , TcFA jWHD f 77rp\' 

A-oo A^ - A-.00 A4 - rveir,\ \ 3W J 

Proof: We prove the upper bound on Tdtrp,2; the upper bound on Tdtrp.s follows on 

similar lines. Consider a generic bead B, with non-empty intersection with Q. Target points 

within B will be generated according to a Poisson process with rate A^ satisfying 

_ Area(i? n Q) ^ Area(i?) _ C^ta ( 1 

^ WH - WH IQpWHX^ V^2 

The vehicle will visit B at least once every Trecbta,i time units, where Trecbta.i is the bound on 

the time required to traverse a path of length Li, as computed in Lemma EH As a consequence, 

targets in B will be visited at a rate no smaller than 

In summary, the expected time Tb between the appearance of a target in B and its servicing by 
the vehicle is no more than the system time in a queue with Poisson arrivals at rate A^, and 
deterministic service rate pb- Such a queue is called a M/D/1 queue in the literature [17], and 
its system time is known to be 

J^M/D/1 = J- + ^ 

flB \ 2pB- ^B, 

Using the computed bounds on Ab and pb, and taking the limit as A ^ +oo, we obtain 

,. Tb ^ Tm/d/1 ^ IQpWH / 1 Cbta 
lim — < lim — f-— < -r 1 H --. 



(9) 
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Since equation ^ holds for any bead intersecting Q, the bound derived for Tg holds for all 
targets and is therefore a bound on Tdtrp,2- The expression on the right hand side of ^ is 
a constant that depends on problem parameters p, W, and H, and on the design parameter 
CbtAj as defined in equation (El). Stability of the queue is established by noting that Cbta < 
i^veii^ + 7/3 TT p/Wy^. Additionally, the choice of Cbta in equation (EJl minimizes the right 
hand side of Q yielding the numerical bound in the statement. We then substitute p = r'^^Jrcb 
to yield the final result. ■ 
Remark 4.3: Note that the achievable performances of the BTA and the CCA provide a 
constant-factor approximation to the lower bounds established in Theorem 14.11 

V. Extension to the TSPs for the Dubins vehicle 

In our earlier works [13], [18], [12], we have studied the TSP for the Dubins vehicle in the 
planar case. In [13], we proved that in the worst case, the time taken to complete a TSP tour 
by the Dubins vehicle will belong to 6(n). One could shown that this result holds true even in 
M'^. In [12], the first known algorithm with strictly sublinear asymptotic minimum time for tour 
traversal was proposed for the stochastic DTSP in M^. This algorithm was modified in [12] to 
give a constant factor approximation to the optimal with high probability. This naturally lead 
to a stable policy for the DTRP problem for the Dubins vehicle in which also performed 
within a constant factor of the optimal with high probability. The RecCCA developed in this 
paper can naturally be extended to apply to the stochastic DTSP in K^. It follows directly from 
Lemma IJ!2l that in order to use the RecCCA for a Dubins vehicle with minimum turning radius 
p, one has to simply compute feasible curves for double integrator moving with a constant speed 
y/pr ctr- Hence the results stated in Theorem 13 . 1 41 and Theorem 14.21 also hold true for the Dubins 
vehicle. 

This equivalence between trajectories makes the RecCCA the first known strategy with a 
strictly sublinear asymptotic minimum time for tour traversal for stochastic DTSP in M^. The 
fact that it performs within a constant factor of the optimal with high probability and that it 
gives rise to a constant factor approximation and stabilizing policy for DTRP for Dubins vehicle 
in M'^ is also novel. 
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VI. Conclusions 

In this paper we have proposed novel algorithms for various TSP problems for vehicles with 
double integrator dynamics. Future directions of research include extensive simulations to support 
the results obtained in this paper, study of centralized and decentralized versions of the DTRP, and 
more general task assignment and surveillance problems for vehicles with nonlinear dynamics. 
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